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DISTRIBUTION OF SQUARES MODULO A COMPOSITE NUMBER 

FARZAD ARYAN 


Abstract. In this paper we study the distribution of squares modulo a square-free number 
q. We also look at inverse questions for the large sieve in the distribution aspect and we 
make improvements on existing results on the distribution of s-tuples of reduced residues. 


1. Introduction 

In this paper we are mainly concerned with the distribntion of subsets of integers that 
are not additively structured, though we will also prove results for sets that are additively 
structured. We begin by studying squares, which is the model example of a non-additively 
structured set. We continue with more complicated non-additively structured sets. The hnal 
part will be the study of the higher central moments of s-tuples of reduced residues. 

The distribution of squares modulo q. For q square-free, we call an integer s a square 
modulo q when s is a square modulo p for all primes p dividing q. Note that we count 0 as 
a square. Several authors have studied the distribution of spacings between squares modulo 
q. For q prime, a theorem of Davenport [6] shows that the probability of two consecutive 
squares modulo q being spaced h units apart is asymptotically 2“^ as q tends to infinity. 
For q square-free, Kurlberg and Rudnick [17] have shown that the distribution of spacings 
between squares approaches a Poisson distribution as u{q) tends to inhnity, where u{q) is 
the number of distinct prime divisors of q. 

Theorem (Kurlberg and Rudnick). Let □ be the symbol that denotes the word square and 

q 

s — - 

: X zs □ modulo q} ’ 

and let I be an interval in M that does not contain zero. Then 

■■ xi-X 2 ^ si ■. Xi,X 2 are U mod q} _ ^ 1 x 

: a; zs a □ modulo q} 

Note that s is the mean spacing in the set of squares modulo q and the “probability” of a 
random integer being a square modulo q is 1/s, which approximately is l/2“'('^h The results 
we prove in this paper are, more or less, in the spirit of papers written by Montgomery 
and Vaughan [19] and Hooley [14, 15, 16]. These articles answer Erdos’ question in [7] 
regarding the gaps between consecutive reduced residues. The reduced residues modulo q 
are the integers Oj, 1 = Oi < 02 < ... < 0^(5) < q, that are relatively prime to q. Erdos [7] 
proposed the following conjecture for the second moment of the gap between consecutive 
reduced residues: For A = 2 we have 

</'(?) 

^A(g) = ^ qP^~^, 

i=l 


1 


( 1 , 2 ) 





where P = 4>{q)/q is the “probability” that a randomly chosen integer is relatively prime to 
q. Hooley [14] showed that (1.2) holds for all 0 < A < 2. For A = 2, Hausman and Shapiro 
[11] gave a weaker bonnd than (1.2). Finally, Montgomery and Vanghan [19] sncceeded in 
proving the conjectnre, showing that ( 1 . 2 ) holds for all A > 0. The key ingredient in the 
proof of the results of [14] and [11] is the variance of the random variable 

P-hin) = #{m G [n,n + h] : m is a reduced residue modulo q}. 

In [19] Variance and also higher central moments of Ph were studied. Motivated by the 
above results, we consider the variance of the following random variable. Let n be an integer 
chosen uniformly at random in {1, 2 ,..., g}, and dehne Xh by 

Xh{n) = ij^{s G [n, n + h] : s is a □ modulo q}. 


Theorem 1.1. Let q be a square-free number and P = 4>{q)/q. Then as an upper bound we 
have 

l^V A . h V _ h 
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, m=l 

n+m is □ mod q 
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2^{q) p 


and as a lower bound we have 
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n+m is □ mod q 


H('T> 

(1.3) 

h y 

(1.4) 

2i^(q )p I 


Moreover, if the prime divisors of q are all congruent to 3 modulo 4 then we have the sharper 
bound 

A h h 

^ ^U){ci) p I — 709 * V^*^) 


^n=0 


, m=l 

n+m is □ mod q 


2 “('?) p 2 • 


Note that in (1.3), np|q(l +P ^ 2V^^ which is much smaller than for large a;(g). 
For the mean of Xh, we have E(T’/i) = and therefore the left hand side of (1.3) is equal 

to the variance of +, which we denote by Var(+). Consequently, Theorem 1.1 implies the 
following upper bound: 

Var(+) + ■E(+), 


p\q 


Vp 


whereas the trivial upper bound is 


Var(T;,) < E(+ 


Remark 1.1. Let N be the set of quadratic non-residues modulo a prime p. The reason 
for the better bound (1.5) is that, when p = 3 mod 4, the size of the Fourier coefficient 
J2n&N e(n/p) of N, is smaller than when p = 1 mod 4. 

In 1936 Cramer [4], assuming the Riemann hypothesis (RH), showed the following result 
concerning the average gap between consecutive primes: 

E ~ Pnf < a;(loga;)^+L ( 1 . 6 ) 

Pn<X 

This bound was the inspiration of Erdos’ conjecture (1.2). Using Theorem 1.1, we prove an 
analogous result for gaps between squares. 
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(1.7) 


Corollary 1.1. Let Si be the squares modulo q in increasing order. Then 

i y (sm - s.)"« 2 “<«>p(iog,) n (i+4)- 

d Si<q p|g \ VL'' 

Remark 1.2. It seems plausible that the factor np|(j(l + can be removed from the right 

hand side of (1.7). Also, it seems difficult to estimate the higher moments in Corollary 0.1. 
Indeed, in the simple case where q equals a prime number p, a good estimation of the higher 
moments would imply that the gap between two consecutive quadratic residues is less than 
po(i)_ Note that the best known bound obtained by Burgess [3] is 


An important property of the squares that we use in the proof of Theorem 1.1 is the following: 
For a 4 0 modulo p we have 


sa\ 
p ) 


« V?' 


( 1 , 8 ) 


E e, 

s is □ mod p 

In the language of Fourier Analysis, this property means that all of the non-trivial Fourier 
coefficients of the set of squares have square root cancellation. In the context of this paper 
we denote the property of having small Fourier coefficient as being “non-additively struc¬ 
tured”. In the next section we generalize Theorem 1.1 for all the sets that are not additively 
structured. We also use similar ideas to study a problem related to additive combinatorics 
which is known as the inverse conjecture for the large sieve. 


Relation with the inverse conjecture for the large sieve. In this section we consider 
the inverse conjecture for the large sieve. We also introduce the notions of “additively struc¬ 
tured” and “non-additively structured” sets and study the distribution of these sets. Based 
on these ideas we formulate a rehned version of the inverse conjecture. Roughly speaking, 
we say that a subset of Z/pZ is not additively structured if all of its non-trivial Fourier coef- 
hcients have square root cancellation. On the other hand, being additively structured means 
there exist at least one large Fourier coefficient. Having a large Fourier coefficient is equiva¬ 
lent to saying that the set has many quadruples {xi,X 2 , x^, x^) such that xi + X 2 = x^ + x^, 
which explains the reason for choosing the “additive structure” terminology. 


Let A be a hnite set of integers with the property that the reduced set A (mod p) oc¬ 
cupies at most {p + l)/2 residue classes modulo p for every prime p\q. In other words, for p\q 
and Qp C Z/pZ with |r2p| = (p — l)/2, A is obtained by sieving [1,X] by all the congruence 
classes in Qp. The inverse problem for the large sieve is concerned with the size of A (see 
[13]). In the case where q is equal to the product of all primes less than y/X, using the 
large sieve inequality one can show that |A| <C y/X. The following is the formulation of the 
conjecture by Green [8]. 

Conjecture (Inverse conjecture for the large sieve). For every prime numberp < yfX, 
let kip C Z/pZ with |Hp| = (p — l)/2. Let A C {1,2,... ,X} be the set obtained by sieving 
out the residue classes in kip for p < X. Then |A| <C unless A is contained in the set of 
values of a quadratic polynomial f{n) = an^ + bn + c, with the possible exception of a set of 
size 

Remark 1.3. This has been stated informally in the literature as follows. If the size of A 
is not too small then A should possesses an “algebraic” structure. The problem with this 
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statement is that a formal definition for possessing an “algebraic” structure has not been 
given. Although it seems that any set with “algebraic” structure is not additively structured, 
the reverse may not be true. 


Here our aim is to look at this problem from the distributional aspect. We consider A 
to be a subset of an interval larger than the interval [1,W]. We hx A to be a subset of 
{1,2,..., q}. This set shall be dehned by sieving out congruence classes in flp for all p\q, 
l^pl = {p ~ l)/2- Next we let n be an integer picked uniformly at random in (1, 2,..., q}, 
and dehne the random variable yh by 


yhin) = \[n,n +h]r\A\. (1.9) 

Since |flp| = {p — l)/2, the Chinese Remainder Theorem implies that 



Question; How is A distributed modulo g? 


We will prove a result which shows that if for all p\q, Qp is not additively structured, then A 
is well distributed. In the other direction we show some partial results in the case that Qp is 
additively structured. The latter result indicates that A is far from being well distributed. 
To make the notion of being well distributed more clear in the context of this paper, let 
A C [1, g] and dehne Prob(a; E A) = |A|/g. We say that A is well distributed if any interval 
of length h inside [1, g], contains h^(l + o(l)) elements of A. 


Now we introduce the notion of a set that is “not additively structured”. We describe this 
using the example of squares. In this case Qp is the set of non-quadratic residues modulo p. 
In other words in order to end up with squares after sieving, we need to sieve out integers 
congruent to non-quadratic residues modulo each prime p\q. Inspired by the property of 
squares mentioned in the Equation (1.8), we have the following dehnition. 


Definition 1.1 (Not additively structured). For p a prime number we say that Qp C 
Z/pZ is not additively struetured if for all a ^ 0 modulo p, 


xL t i ip 




( 1 , 10 ) 


where Cp depends on p and satisfies Cp logp. 

We will give two examples of sets that are not additively structured. 


Example 1. By using the following theorem of Weil we can show that the image of a poly¬ 
nomial P is not additively structured under the following condition: For every y G Im(P) 
the equation P{x) = y (mod p) has the same number of solutions, with the exception of a 
subset S of the image with \£ I < Vp- 

Theorem (Weil). Let P G Z[X] he a polynomial of degree d > 1. Let p be a prime such that 
gcd(d,p) = 1. Then we have 


x[mod p) ^ 


( 1 . 11 ) 
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Example 2. Another example of a set that is not additively structured is 

^K,p '■= {x + y ■. 1 < x,y < p — 1 and xy = 1 mod p). ( 1 - 12 ) 

This can be shown by using the Weil bound on Kloosterman sums. The size of ^k,p is 
(p + l)/ 2 . One open problem regarding this asks about existence of small residue classes 
with small reciprocal. More precisely let 

Mj, ■= min < maxja:, L (1T3) 

^ x^O mod p t J J ^ ^ 

Then the question is how small can Mp be? As an application of the Weil bound on Kloost¬ 
erman sums one can show that Mp < 2(logp)p^/^ (see [12]). It seems natural to conjecture 
that Mp < In fact Tao [20] even suggested that Mp = might be possible. 

Using Theorem 1.2 one can show that there exist x modulo p such that 

X + x~^ mod p < . (1-14) 

For such X if x+i,x~^ < p, then (1.14) would imply the conjectural bound for Mp. However if 
xd-zx~^ > p, then (1.14) does not give any useful information. Thus it would be interesting 
to look at the distribution of the set 

^'k,p •= : 1 < a:, x~^ < p — 1 and x +1 x~^ < p|. (1-15) 

If ^'k,p were not additively structured then it would imply the conjectural bound for Mp. 
However in Theorem 1.4, we will show that this is not the case and is not well dis¬ 
tributed modulo p. Consequently one way to attack the conjectural bound on Mp would be 
to find a proper subset of ^k,p which is not additively structured. Another way would be to 
add certain elements to ^k,p order to make a set that is not additively structured. 


We show that if Qp is not additively structured then A is well distributed. 


Theorem 1 . 2 . Let he as (1.9). Then if Lip is not additively structured i.e., satisfies 
( 1 . 10 ) and Iflpl = (p — l)/ 2 , then we have 


1 

Q 


E 


71=0 



m^flp mod p 

Vp|<? 



(1,16) 


or equivalently 

Var(31J«E(y»)n(^^ + l^), (1,17) 

where Cp is the constant in (1.10). 


Remark 1.4. Note that Cp can never be too small. In fact one can get a lower bound Cp > 1/2. 
As a result the right hand side of (1.16) is always bigger than 


Remark 1.5. Note that the trivial upper bound on Var(3^/i) is E(A’/i)^. In section 3 we prove 
a more general result without the restriction |r2p| = (p — l)/2 (see Lemma 4.1). 


Remark 1.6. By taking Lip equal to the set of quadratic non-residues in Theorem 1.2, we 
obtain Theorem 1.1. 
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Returning to the inverse conjecture for the large sieve, Green and Harper [9] proved the 
conjecture when Qp is an interval and gave a non-trivial result when Qp has certain additive 
structure. This brings us to the dehnition of a set with additive structure. 


Definition 1.2 (Additively structured). For p a prime number we say that Qp C Z/pZ 
is additively structured if there exist a ^ 0 modulo p, 


I ax\ 

Edy) 




> Cpp, 


(1.18) 


where Cp depends on p and here we consider Cp 3> log ^ p. 


Remark 1.7. Note that additively structured is the extreme opposite of not additively struc¬ 
tured, since the opposite of not additively structured means every set has a Fourier coefficient 
just bigger than while being additively structured means there exists a Fourier coef¬ 

ficient bigger than p^“'^. 


Let Vtp = {0, 2,4,... ,p — 1}, for all p\q. 
for a = (p -|- l)/2 we have 


Note that this set is additively structured since 



For the set A we prove a result which shows that A is far from being well distributed. 


Theorem 1.3. Let Lip = 

where X <pi < 2X and 


1 

q 


E 


n=0 


{0, 2 ,4,... ,p — 1} and yh be as (1.9). Assume q = pi,... ,P[iogx:j 
\P 2 — Pi\ ^ log Pi. Then for every integer h < we have that 


m£[n,n+h] p\q \ ^ 

m^r2p mod p 

Vpk 



f—7 




or equivalently 


\av{yh) > E{yhf. 


Theorem 1.2 shows a connection between non-additive structure in sets Lip and well dis¬ 
tribution of A. Theorems 1.3 shows a connection between the additive structure of the sets 
Lip and A not being well distributed. Recall that A is obtained by sieving out the congru¬ 
ence classes in Lip. In the inverse conjecture for the large sieve, there is a similar connection 
between the size of the sifted set and the additive structure of Lip. More precisely, if the size 
of the sifted set A is not too small, then A is the image of a quadratic polynomial and from 
Example 1 we know that the image of a quadratic polynomial is not additively structured. 
Thus if the size of A is large then A is not additively structured. Inspired by this observation 
it seems natural to refine the inverse conjecture for the large sieve in terms of the additive 
structure of A. Now we state our conjecture. 


6 












Conjecture. Let A he the subset of[l,X] obtained by sieving out congruence classes in Qp 
for p < . Moreover assume that for each p, fip is additively structured i. e. Up has the 

property that there exist a ^ 0 modulo p such that 



with Cp S> log ^ p. Then 


|A| 


Harper and Green [9] proved a non-trivial bound for the size of A in the above conjecture. 
They proved that if Gp has many quadruples (xi, X 3 , X 4 ) such that xi + x^ = 2^3 + x^, then 
there exists c > 0 such that |H| Note that the quadruple condition is equivalent 

to VLp having a large Fourier coefficient. (Larger than 


To hnish this part of the article we state a result regarding the distribution of from 
Example 2. Note that 



if p = 3 mod 4, 
if p = 1 mod 4. 


(1.19) 


Theorem 1.4. Let be as in (1.15). Then for h < p/2 we have 


1 

p 


E 


n=0 


E 1 


^ m£[n,n-\-h] 
m£Q'r 


K,p 



> h^. 


( 1 . 20 ) 


In the last part of this article we study the distribution of s-tuples of reduced residues. Al¬ 
though the following theorem is independent than previous results, the techniques are very 
similar. In particular Lemma 2.1 will be applied in all theorems. 


Higher central moments for the distribution of s-tuples of reduced residues. Let 


T> = {hi,h2,...,h,}, 


and i'p{T>) be the number of distinct elements in T> mod p. We call T> admissible if Vp{T>) < p 
for all primes p. We call (a -|- hi,..., a -|- h^) an s-tuple of reduced residues if each element 
a + hi is coprime to q. In our previous results we were only able to calculate the variance and 
could not obtain any estimate for higher moments. The reason for this, in a sieve-theoretic 
language, is that when |f2p| = (p — l)/2, as q tends to inhnity the dimension of the sieve 
also tends to inhnity. However, if we hx our admissible set and look at the distribution of 
s-tuples of reduced residues, then the dimension stays bounded and consequently we are able 
to derive results for higher moments. Let kq{m) be the characteristic function of reduced 
residues, that is to say 


kq{m) 


1 if gcd(m, q) = 1, 
0 otherwise. 
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( 1 , 21 ) 


The generalization of Erdos’ conjecture, i.e. 

V2^{q)= E (a.+i - 

{ai+hj,q)=l 
hj&V 

concerns the gap between s-tuples of reduced residues. In order to prove the generalization 
of Erdos’ conjecture (see [7] and [1]), the author in [1] studied the /c-th moment of the 
distribution of the s-tuples of reduced residues: Let 

E ( E + m + hi)... kq{n + m + h^) - hY[ (l - 

n=0 \m=l p\q \ P 

In the case s = 1, i.e. V = {0}, and k < 2 this was studied by Hooley [14] who found an upper 
bound for h). Hausman and Shapiro [11] gave an exact formula for h). Their 

formula immediately gives the upper bound < qhP. Finally, for a fixed natural 

number k, Montgomery and Vaughan [19] showed 

Ml°\q, h) < q{hPf/‘^ + qhP. (1.22) 

For a fixed admissible set P it was proven in [1] that 

^^(g, h) (1.23) 



This was enough to get the generalization of Erdos’ conjecture, however the method failed 
to get bounds as strong as (1.22). In the last section of this paper we improve (1.23). 


Theorem 1.5. Let P 


and in general 


(j){q)/q- For h < exp ho^ve 




(1.24) 

(1.25) 


Remark 1.8. Note that (1.24) is the best possible upper bound and it matches the upper 
bound derived from probabilistic estimates (see [1, Lemma 2.1]). 


The open question that remains here is whether or not the bound (1.25) is sharp. In other 
words, is there an admissible set V such that for h > exp we have 

M^{q, h)»,,fc 


Notation. Throughout the paper we use the symbol □ as an abbreviation for the word 
“square”. For example, “a is a □ modulo q” reads “a is a square modulo g”. Also, for functions 
g{x) and h{x), we use interchangeably Landau’s and Vinogradov’s notation g{x) = 0{h{x)), 
g{x) <C h{x) or h{x) 3> g{x) to indicate that there exists a constant C > 0 such that 
\g{x)\ < C\h{x)\ for all x. We use subscripts such as <^s,k to indicate that the constant C 
may depend on parameters s, k. We let cf) denote the Euler’s totient function, defined by 
4>{q) = ^ n < q : {n,q) = 1}. We also write P = <i){q)/q and we let denote the 

addition in Z, as opposed to modular addition. 





2. Main estimate 


In this section we prove an exponential identity for the indicator function of s-tuples of 
reduced residues. 

Lemma 2.1. Let T> = {hi,hg} be an admissible set. For square-free integers q we have 

/i(r) 


kq{mhi)... kg{mhg) = e{m-]fiv{a,r), 

(Pvir) V rj 


(a,r)=l 


where 




p\r \ s&T>p 


\ p 


Mr) = l[{p-iyp{V)), 

p\r 


Pv = 


Uplq {P - MP) 


(r/p)p^ is the inverse ofr/p in [fLjpl^ , andVp consists of the reduction of elements ofV 
modulo p. 

Proof. The starting point in the method of Montgomery and Vaughan [19] is to use the 
following Fourier expansion of the indicator function of reduced residues: 

k,{m) = Y.— E elm-V 


r\q ^ 0 <b<r \ ^ / 


Using this expansion, we deduce that 

kq{m+hi) ... kq{m + hg) 


h'('i’i) • • • h'(u) I ^ 

= L E e L(m + h,)- 

ri,r2,.--,^s|g I*** s f)<ai<ri \ 2=1 ^ 

E s fl_a 

i = l r A r 


= E 


r\q 

a<r 

(a,r)=l 


a 

el m— 
r 


^ /i(ri) .../i(r,) E fthp 


ri,r2,...,rs\q 


ri... r 


^ 0 <ai<ri \ 2=1 


i=\ rj r 


We £x a, r and therefore it is enough to show that 




ri,r2,...,rs|(? 

= P 

J. ^ 


0 <ai<ri \ 2=1 


ri...rs 

E S O'i _a 
i=\ rj r 

_nf yJfLLLil 


Vi 


( 2 , 1 ) 


p\r \ sGT>p 

To show this, note that we can write 


p 


“ (modi) 


p|<? 


P 
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( 2 , 2 ) 


uniquely where 0 < Qp < p. Fixing po\r, we have that 


(niodi), 

r Po Po Po 

hence a = apA —) (mod po). Since g, and consequently r, are square-free, (—,Po) = 1, so 

V r)r, / \Po / 


/ r \ -1 

for a„o 7 ^ 0, we have that Opp = a( —) (modpo)- Using (2.2), we can can write the left 

^Pq'PO 

hand side of the (2.1) in terms of the prime divisors of q. Therefore (2.1) is equal to 


HE 

p\i <?iip 


/i(gi).. ./i(g,) 

qi...qs 


E 

0<aj<q'i 2=1 

Q-i V 


To simplify the condition X) ~ = we write 


0 <ai<qi i=l 

p 

Therefore (2.1) is equal to 




p=l d* 0 <ai<qi 


P 


i=l 


HE 

pk 9i|p 


/i(gi).. ./i(g.) 


qi---qs “iPo<“<q, 


E^ E de-E^ddEft.^ 


2 = 1 




P e 

HE- 

p|g p=l n 


HE— E 4-(h. 


*=i ijib 




0 <ai<qi 


qi 


P e 

=nE 


hd) 


p\q i;=l P i=l 


zJ2A-ihi-v)) 


P 


a<p 


p 


=n E 


A _ Mq)r TT pA) 


n 


p\q v=l P P s^h,raodp 


E. 


v^hi mod p 
hi 


hi GX? 
cip^O 


The last equality holds since Op = 0 for p f r, and for Op 7 ^ 0 we have that 


E d>'^) = -5:d‘ 


21=1 
v^hi mod p 
hi^V 


-sGX^t) 


p 


This completes the proof of the lemma. 


□ 


3. Distribution of squares modulo q 

In this section we are going to prove Theorem 1.2 and Corollary 1.1. Before proceeding 
with the proof we derive a formula for the left hand side of (1.3). For q square-free, x is a 
square modulo q if and only if x is a square modulo p for all primes p dividing q. For each p 
which divides q, let Vp := {hi p,..., hiy^^p}. By the Chinese Reminder Theorem there exists 
a set P = {hi,..., hs}, such that V modulo p is equal to Vp, for all p|g. For instance let 
hi, h 2 ,... hg to be uniquely selected to satisfy the following congruences hi = hi^p mod p, for 
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all p\q. In the case that i > Up and therefore hi^p does not exist, we take hi^p to be equal 
h^j,^p- This explains how we can construct the set V. 


Now if 


kq{m + hi) ... kq{m + hg) = 1, 


then m ^ —hi^p modulo p, for 1 < i < i^p and for all p dividing q. We now let 

T)p = {-ni,...,-np^}, (3.1) 

where n^s are quadratic non-residues modulo p. From kq{m + hi)... kq{m + hg) = 1, it 
follows that m is a square modulo q. Using Lemma 2.1 we have that 

kqim + hi)...kq{m + hg) ^ e(m^^pvia,r) 


pk 


r\q 


ripi 


..w.1^ I Irjlr* rj ntf '' ' ' 


r 2 CL<r 


2 aj(g) p TT . . 

^ r\q llp|r 2 a<r 

(a,r)=l 


(3,2) 


where P = Summing this from m = n-|-lton-|-h and then subtracting the term 

corresponding to r = 1 we have 

n+h 

^ kq{m + hi)...kq{m + hg)-^^^ 


m=Tl+l 


■E 


p(r) 


2 ui{q )P ^ n , 2±i - 

r\q ttp|r 2 
r>l K»')=l 




(3.3) 


where 


E{x) = ^ e{mx). 


m=l 


We square (3.3) and sum from n = 1 to g to obtain 

q-l / h 

EE kq{n + m + hi)... kq{n + mP hg) 
n=0 \m=l 


h 


2u}(q) p 


(3.4) 
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I.roullplri 2 Ilp|r2 9 ai<r / \ ^2 / 


4uj{q)p2 ^ rr P±irr P±1 ^ 

ri,r2\q^mTl 2 ^mr2 2 «i<?’ 


1 = 1,2 
ri r2 


Now we are prepared to prove the Theorem 1.1. 
Proof of Theorem 1.1. Fro: 
and 02 = r — oi, thus we have 


Proof of Theorem 1.1. From the condition — — G Z in (3.4) it follows that ri = r 2 


q-l 

E 

n=0 


E 


1 - 


h 


q 


E 


ri r 2 


E 


n+m is □ mod q r>l (^^ 5 ^)=! 


E[^)liv(a,r) 


(3.5) 
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Now, we need to bound //©(a, r). For each n* in Vp in (3.1), employing the Legendre symbol 


'-nia(r/p^ ' 


P 


hi 


(r/p) 


-1 


P 


Since a 7 ^ 0 the sequence | — nia(r/p^ | is either the sequence of quadratic residues or the 
sequence of quadratic non-residues modulo p. Using the Gauss bound for exponential sums 
over quadratic residues (respectively non-residues) [5, Page 13] 


nia{r/p)p 


-1 


P 


= if 


'—a[r/p^ 


P 


= -l, 


otherwise. 


if p = 1 modulo 4 and 


E' 


niairjp) 

P 


-1 


\/P + 1 


if p = 3 modulo 4. Consequently, for a 7 ^ 0, 


p\r p\r 


Using this in (3.5) we have the upper bound 


(3,6) 


(3.7) 


(3.8) 


q-l / h 

e( e 

71=0 

and the lower bound 


m=l 

n+m is □ mod q 


2 ^( 9 ) p 


< 


Q 


4 ^( 9 ) p 2 


En 

r\q p\r 
r>l 


f+iy 


{p+^y £ 

(a,r)=l 


E 


(3.9) 


Q 


p2 


En 

r\q p\r 
r>l 


p- 1 


ip+ 1)^ s 

(a,r-)=l 


i?(- 


2 q—1 

<E 

n=0 


' h 

E 

, m=l 

n+m is □ mod q 


2uj{q) p 


Using the bound ([19, Lemma 4]), 


(3,10) 


E 

a<r 

(a,r)=l 


< r min(r, h), 


and by employing this bound in (3.9) we have 

T \ 2 


q-l 

E 

n=0 


h 

E : 

, m=l 

n+m is □ mod q 


< 


2ui{q) p — /\pi{q) p 


1 ftjpa + HddyPzix 


vk 


p2 + 2p + 1 ' 2‘^(9)p 


ftfKi 


vk 


For the lower bound, let r > h?. Then we have 

(j){r)h<^ E 

a<r 

(a,r)=l 

12 


£ 1 “ 


(3.11) 


—) 
Vp 


2 



































Therefore, 


g , ^ 3 


9-1 


4 ‘^( 9 ) p 


h E n(i - ^)« E 

r>h^ p\t V-^ n—0 

r\q 


E 1- 


h 


, m=l 

n+m is □ mod g 


2aj(g) p 


□ 


Proof of Corollary 1.1. Let 


L{x) = #<1 f ; 1 < f < n - Si > a: 


Pl9 


Then 


poo 

(Si+i - SiY = 2 L{y)ydy. 

si<q 


(3.12) 


For 2 /< Moggnp|g(l + ;^) we bound (3.12) trivially. To bound L{y) we note that if 

Sj+i — Si > h, then 

V ^ h _ h 

2^ 2‘^(q)p 2‘^(i)p’ 

m=l 

72+m is n mod q 


for Si < n < Sj+i — h. Therefore we have 


Si+i—Si>h 


h \2 / 

«E 

72=0 \ m=l 

n+m is □ mod q 


2u){q) p 


h 

E 


2ui{q) p 


(3,13) 


Now if we take y = [h/2] then the left hand side of (3.13) is 

Thus, by employing Theorem 1.1 we get the following bound: 

2^1(9) p X 


Liv) < 


y 


n(i 

Pl9 


Vp 


Applying this bound in the integral in (3.12) and the fact that for y > q, L{y) = 0 completes 
the proof of the Corollary. □ 


4. The general case 

In this section we will prove Theorems 1.2, 1.3 and 1.4. Let Qp C "L/pL. We are interested 
in numbers less than q such that, modulo p, they do not occupy any congruence classes 
in Op, i.e. {m < q : m ^ Qp mod p}. By the Chinese Remainder Theorem there exist 
Y\p\q{'P~ l^(p)l) such numbers. A natural question is to ask about their distribution modulo 
q (see [10]). Lemma 2.1 shows the connection between the distribution of these numbers 
and the exponential sum over elements in Op. Let P = {hi,... ,hs} be a set such that 
Vp = {—a; : u G Op}. If kq{m + hi) ... kq{m + hg) = 1, then m is not congruent to any 
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member of Qp modulo p. Now we take a look at the distribution of these numbers. Observe 
that 

h 

kq{m + hi).. . kq{m + h^) 


m=l 


P l^pl 


/i(r) 


Plq P Up\r{p - \^p\) 

(a,r)=l 


^ ^ r /^25(a,r) 


By a calculation similar to (3.4) we have 


X! 1 X! + m + hi)... kq{n + m + h^) - h J| 


Ti=0 \m=l 


pi<? 


p 


= (ill 

p\q 


'p - 


E 


E 


P ) Yl np|r(p l^pl)^ 0<a<r 
r>l (a,r)=l 

In the next lemma we bound the variance. 


Fl-jp^(a,r) 


(4.1) 


(4.2) 


Lemma 4.1. Assume that for eachp\q, |r2p| = d^p with {p — |r2p| > p^/^+'^),and |/ix)(a,p)| < 
Cpy/p, where < 1. Then we have that 


£( i: i-'-nf’’ 

11=0 y mG[n,n+/i] p\q 

m^flp mod p 
Vp|(j 


p 


<qhYl[il-d^f + cl 

p\q \ J 


Proof. Using the assumptions in Lemma 4.1 and (3.11) we have 
E ( E + m + hi)... kq{n + m + h^) - h 

n=0 ym=l p\q 


P 




p\q 


P J r\q np|r(P “ dpPf 


r 

r>l 


pk 


r\q p\r 
r>l 


l-d 


< ghn(l-c;)0 1 + 


pk 




= qh\{{{l - dj,f + cl 

p\q \ / 


This completes the proof of the lemma. 


□ 


Proof of Theorem 1.2. This follows from Lemma 4.1 by taking = {p — l)/2p. Recall 
that c' = ^. □ 

p p 

Next we prove Theorem 1.3: 

Proof of Theorem 1.3. . Let V* = {hi,... ,hs} be an admissible set such that P* = 
-kip = {0, -2 ,..., -{p - 1 )}. Let ^ ^ for Op = Since "D* = { 0 , 2 ,... ,p - 1 }, 


p\r 
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applying Lemma 2.1 we have that 


|/i 2 ?‘(a,r)| = n 

p\r 


Ee 

s&V* 


Sttr 


P 


n 

p\r 


’( 5 ) + ^ 




p\r 


Here, similar to the square case (section 2), we have Pd* = and 0x)*(r) = 

Consequently, using (4.3) we have, similarly to (3.2) and (3.5), that 


(4.3) 

p — 1 


p\r 


q -1 / h 

EE kq{n + m + hi) ... kq{n + m + h^) 
n=0 \m=l 


hP ' 

2 ^( 9 ) 


qP^ 

22 aj(g) 


E 

r\q 

r>l 


riplr 


E 

0 <a<r 


E[ - Ui 5 *(a,r) 


p2 4.(0 , 

E p\2^—]pv*{a,r) 


^ 0(^)2 ^ ^ 
^p|r^ 


/ (a,r)=l 

% 

Pi, P 


/ I O 2 

^ Ar^\2^2uj(r) ^ 1^0 


22aj(g) A(q,\2q^2u){r) Z- 

r\q “ 2 p|7- 


r>l 


^p\r 


2p 


(4.4) 


±1 


Now, for r with an even number of distinct prime factors and || ^ — ||<C 1/h, where 


denotes the distance to the nearest integer, we have 


p\r 


P 




> 


Consequently (4.4) is 


^2 

» 


E 


^ujir)r 


HE 


r 

r>l 

±1 

p\r p 


0(r)^7r^^h) 


ll<iM 


Now let r = piP 2 , with Opj = ^ and —, we have 


2pi 

which implies that (4.4) is 


Pi + 1 P2 - 1 
2 p 2 


> 


2pi 2p2 

qP"^ 


< 


logX 


X 2 




22u}{q) 


h\ 


□ 


Remark 4.1. We picked h = so that the expectation of 
#{m E {n, n + h] : m ^ V* mod p, for all p\q} = h 

p\q 


X^P 


p + 1 _ 

2p ~ 2 Li°g^JlogX 
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is greater than 1. This is important in order to have the possibility of cancellation inside 

9-1 / h 

H + m + hi)... kg{n + m + h,) - 

n=0 \m=l ^ 

We complete this section with the proof of Theorem 1.4. 



Proof of Theorem 1.4. We begin with giving the proof for equation (1.19). Recall that 

^'k,p : 1 < ai, < P — 1 and x +z x~^ < p|, 

If X +z x~^ < p then (p — x) +z (p — > p, Therefore half of the congruence classes 

modulo p contribute to the size of ^'k,p- Also, for y < p we have x +z x~^ = x~^ +zx = y. 
This means that each y G has a double multiplicity, with the exception of y equal to 
1 + 1“^. Considering the fact that for p = 1 mod 4, there exists an x such that x~^ = p — x, 
and therefore x + x~^ = p. This completes the proof of equation (1.19). 


Now let flp := I — a; : a; G {0,1,... ,p — 1} \ ^'k,p\ = Using (1.19) we have 

|f2p| = |p + 0(i). If /cp(m + a;i)/cp(m + a; 2 ) • • • kp{m + uj\Q,^\) = 1, then m G ^K,p- We use (4.2) 
to transform the left hand side of (1.20), and we have 

2 


1 / 

" n=0 \m£[n,n+h] 


p 




K,p\ 


> 


pz 


E 

0<a<p—l 


E 




(4.5) 




K,p 


To hnish the proof of the theorem it is enough to show that ^ hp. Since 

h < pI2 we have that h. For we have 


/^np(l,P) = E 


Recall that Qk,p,^'k,p defined by (1.12) and (1.15). Therefore if —u G kip then oo G 
(^Z/pZ \ klK,p) U : X +2 x~^ > p|. We have 


/^np(Up)= E e 
x£ZIpZ\Uk,p 


-X 


P 


+ E 

ujG{x-\-x ^:x-\-i^x 


p / 




Using Weil’s bound for Kloosterman sums, the first sum above is 0{y/p). For the second 
sum we prove 


-f) = §^ + 0{VP^ogp). 


(4.6) 


E e, 

uj€{x+x ^:x+zX ^>p} 

The following argument for (4.6) was given by Will Sawin and Noam Elkies on Math Overflow 
[2]. The left hand side of (4.6) is equal to 


E ^ )^{x+jy>p}- 


l^x,y<p—l 


p 
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We use a two dimensional Fourier transform to evaluate the left hand side of (4.6). Let A(a, b) 
be the Fourier transform of and B{a,b) be the Fourier transform of 

Then by using Parseval-Plancherel formula, the sum in (4.6) is: 


Eo<a,&<p-i A(a,6).S(a,6) 


where 

A(a,&)= )=F(a,5;p), 

^<x<p P 

B(a,b)= T + 

0 <t^<p P 

x+y>p 


(4.7) 


Note that A(a, b) is the Kloosterman sum unless a = b = 0. For ]3{a, b) when 6 7 ^ 1 we have 


B{a,b) = Y. 


(a — l)x 


l<x<p 


P 


E e 

. p+l—x<y<p—l 


/ {b-l)y \ 

\ p / 


1<X<P 

= E 

l<3;<p 

The hrst term in the latter sum is 


E ' 


P 

e(^) e( 


(a-fe)x+b-l) \ ^ 
P ' 




e( 


6 - 1 ' 


- 1 


P -1 


if a = 1 and 


-1 


e((6-l)/p)-l) - - - - e((6-l)/p)-l 

term in the latter sum is ii a = b and otherwise. 


otherwise. The second 


Note that if a = 6 = 1, then 5(1,1) is {p — l)(p — 2 )/ 2 . Also if 6 = 1 and a ^ 1 we 
have 

Now the main term in (4.7) comes from the contribution of A(0, 0)5(0, 0). The error term can 
be handled by using the Weil bound on A{a, b) for (a, b) 7 ^ ( 0 , 0 ) and the above elementary 
estimates for 5(a, b) for (a, b) A (1, !)• D 


5. Higher central moments of reduced residues modulo q 

In this section we will improve the result in [1] regarding the higher central moments of 
s-tuples of reduced residues. The improvement comes from using Lemma 2.1 to transform 
characteristic functions of s-tuples of reduced residues to an expression in terms of expo¬ 
nential sums. The rest of the proof will follow Montgomery and Vaughan’s [19] arguments 
(Lemma 7 and 8 in [19]). The important part of the proof is to estimate the innermost 
sum in (5.1), which we divide into two cases: diagonal and non-diagonal conhgurations. In 
the diagonal conhguration the estimate derived is good enough for our purposes. In the 
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non-diagonal configuration we use Lemma 7 and 8 in [19] to save a small power of h. Let 
V = {hi,..., hs} be a fixed admissible set. By employing Lemma 2.1 we have that 

k 


q-l / h \ ' 

EE kq{n + m + hi)... kq{n + m + hg) — hPx> j 
n=0 \m=l / 




.|q \i=l ^vi'f'i) J 0<ai<ri \ \ 


'I": 
ri>l 


Vk / 


(ai,ri)=l 

Ek 

4=1 7"^ 


where 


hv{a,r) = \{[ Y. 


p\r \ s=himod p 
hiGV 


sa{r/p)p 

P 


-1 


Let F{x) = min(/i, n^) where ||x|| is the distance between x and the closest integer to x. 
We have that \E{x)\ < F{x). Since |/ix)(a,r)| < we have 


k M{ri) 

M^(q,h)<t:qPiYl E IfxTrr 

r\q nlr i=l r) o<ai<ri 

ri>l {ai,ri)=l 

[ri,...r^.]=r 


- '(? 


'(S' 


(5,1) 


Proof of Theorem 1.5. We use the method in [19] to bound 




0<ai<ri 

(a,i,ri)=i 

yk ^ 


\rk, 


in (5.1). First we focus on diagonal configuration i.e. ri = r 2 ,r 3 = r 4 ,... ,rk-i = and 
r 2 , r 4 ,..., are relativity co-prime. In the diagonal configuration we have that 




0<ai<ri 
{ai,ri)=l 
yk fii 2 

Z_^ 4 =l r,' 


\rk 


0<ai<ri 


\ri 


0<afe_i<rj._ 


fI 

1 Vk-1 


< rira ... rk-ih’"^'^ = [rirg ... 

Consequently, the contribution of the the diagonal configuration in (5.1) is less than 


qPiY. E 


r. 


r\q [ri,...,rfe]=r 
(r-2i_i,r-2j-i)=l 

i¥^j 


g2aj(ri) 

(/>D(n)^ 


rs 


g2uj(r3) 




=<iPvi: 


(g 2 k\uj{r)' 


r\q \ Mry 


h^/"^ = qPJ^l[il + 


ps^^ 


p\q 


{p-Up{V) 


0c(r)2 
^1 


(5.2) 
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In (5.2) we used the fact that the number of fc-tuples (ri,..., r^) with [ri,..., r^] = r such 
that each p divides exactly two of Uj is less than (/c/2)“h) (see [18]). In the non-diagonal 
configuration Lemma 7 in [19] allows ns to save a small power of h. Now we state the Lemma 
7 in [19] and explain how it should be apply. Our aim is to get the following 

M^{q, h) < (^1 + . (5.3) 

This bound is analogous to [19, Lemma 8 ] and its proof is nearly identical. The key difference 
is in (5.1) we have instead of l/0(r). Our main tool is the following lemma. 


Lemma 5.1 (Montgomery and Vaughan). For k > 3, let ri,... ,rk be square free numbers 
with r* > 1 for all i. Further let r = [ri, r 2 ,..., n^], d = {ri, r 2 ), ri = dr[,r 2 = dr'^. and write 
d = st where s|r 3 ... (t, r^r^ ... r^) = 1. Then 


where 


and 


Y, F f ^y .. F f ^ « r 1... rfcr -1 (Ti + T2 + Ta + T4 ) 

Q<a^<n VF/ Vk) 


Ti = h-i/20. 

T 2 = d~^^‘^when f > h^^^for all i, 

T 2 = 0 otherwise; 

T 3 = s~^^'^when f > h^^^ for all iand ri = r 2 , 
Ts = 0 otherwise; 


T 4 = 


( —^— E 

\rir2sh^ 



when h®/® < ri < h'^ for i = l,2,t > d^^'^and d < h^^^, 
T 4 = 0 otherwise. 


(5.4) 


We shall also use the following estimate [19, Lemma 1] 

Y <fk\ 1 


E 


0<ai<ri 

(ai,ri)=l 

yh ^ 



...F 




f'k. 



(5.5) 


2 = 1 


Now we explain how to choose ri,r 2 in order to apply Lemma 5.1. Note that we only need 
to consider those /c-tuples r = (ri, r 2 ,,..., r^) for which f > 1 , [ri,..., rfc] = r, and each prime 
divisor of r divides at least two of the f, since otherwise the sum on the left hand side of 
(5.4) is empty. If f < h®/® for some z, then by using (5.5) and [19, Lemma 4] we have our 
desired result. Now suppose that f > h®/® for all z, and set dij = (ri,rj). For each i we can 
hnd a j, such that 

dij > 

19 


(5.6) 





If there is a pair {i,j) for which this holds and r* 7 ^ rj, then in Lemma 5.1 we choose these 
to be ri,r 2 . We note that if r* = ri then dij = > h®/®, and (5.6) holds. Suppose now 

that (5.6) holds only when = rj. If there is a triple {i,j, k) such that r* = ri = then 
we apply Lemma 5.1 with ri,rj as ri,r 2 . Otherwise the r* are equal in distinct pairs, say 
ri = r 2 , ra = r 4 ,..., rk_i = r^, and k is even. Let v be the product of all those prime factors 
of r which divide more than one of the numbers r 2 ,r 4 ,r 6 ,... ,rfc. Then there exists i such 
that 


k2*,noi > 

\ j^i / 

In this case we take ri and r 2 to be r 2 j_i,r 2 j and by employing Lemma 5.1 we have 




r\q 


ri\r 

ri>l 

[ri,...rfe]=r 


=1 (l^viri) 


^(Ti+T2+T3 + T4). 


(5.7) 


(5.8) 


(5.9) 


Note that if any of T 2 , T 3 , or T 4 is non-zero then d > 9)_ The contribution of Ti to 

(5.8) is 

< (1 + ^psk-{s+l)>‘f^-l/ 20 _ 

p\q \ P / 

By the selection of ri, r 2 we have that if T 2 7 ^ 0 then d > Therefore the contribution 

of T 2 to (5.8) is <C Now for T 3 we have ri = r 2 and ri > h®/®. If 

ri = r 2 = Ti for some i > 2, then s = r > h®/®, so that T 3 < Ti and therefore the 
contribution of such T 3 to (5.8) is smaller than Ti . It remains to consider the case when 
ri = r 2 ,r 3 = r 4 , ...,rk-i = Vk- Let r = uv where u is the product of those primes dividing 
exactly one of ^ 2 , r 4 ,..., r^. Then each prime divisor of v divides two or more of the r 2 i- 
By our choice of ri,r 2 we have s > . Put = UiVi where Ui = {ri,u) and Vi = {ri,v). 

Suppose that u and v are hxed, and let C{u, v) denote the set of (ri,..., r^) of the sort under 
consideration. We have \C{u,v)\ < dk/ 2 {u)d{v)’^^'^. Using the change of variable r = uv and 
by rearranging the sum in (5.8), for the contribution of T 3 we have 


E— E 

uv\q {ri,...,rk)&C{u,v) 




I <l>T>{v) 


UV 


1+2/fc 


(5,10) 




p\q 


2(j)l{p) 


P 


1+2/fc 


< P-—. 

For the contribution of T4, by the Cauchy inequality, we have 

2 \ 1/2 


1 k a^(.ri) / 1 / Mn) 

T.- E n 7 - 77 fr 4 « E; E 117-77/ 

r\q ri\r i=l */ y r\q V 4 ^T>\ i) 

ri>l 

[ri,---rk]=r 


1/2 


E) E n 

r\q n,...+fe 


(5.11) 
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The first factor on the right is made larger as it runs over all fc-tuples for which ri\r. The 
larger expression is 



(5,12) 


The second factor has been treated precisely in [19, pp. 324-325] and it is smaller than 
j^-2/7k^ By combining (5.9), (5.10), (5.11) and (5.12) we complete the proof of (5.3). Note 
that we have just sketched the key ideas of the proof, the interested reader can find further 
details in [19]. To finish the proof of the Theorem 1.5 we appeal to Lemma 3.1 in [1]. Let 

Qi = Ylp\q P and q 2 = Ilpig P, where y > . We set Pi = for i = 1, 2. Then [1, Lemma 

p<y p>y 

3.1] states that 

(g, h) < g(hP")I^/2] ^ p qh^Ppsk_ 


This lemma is obtained by combining two different estimates of M^{q,h): an exponential 
estimate and a probabilistic estimate. The exponential estimate stated in [1, Lemma 1.2] 
gives 

M®(g, h) < 

Here we use the estimate (5.3), instead of the above estimate and we derive: 

M^{q, h) g(hP")['=/2] + qhP^ + (l + 

Now by considering y = we have (1.25) and for h < eLPT^, we have (1.24), which com¬ 
pletes the proof. □ 
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